Analysis of Legal Documents via Non-negative Matrix Factorization Methods

نویسندگان

چکیده

The California Innocence Project (CIP), a clinical law school program aiming to free wrongfully convicted prisoners, evaluates thousands of mails containing new requests for assistance and corresponding case files. Processing interpreting this large amount information presents significant challenge CIP officials, which can be successfully aided by topic modeling techniques.In paper, we apply Non-negative Matrix Factorization (NMF) method implement various offshoots it the important previously unstudied data set compiled CIP. We identify underlying topics existing files classify request crime type status (decision type). results uncover semantic structure current provide officials with general understanding newly received before further examinations. also an exposition popular variants NMF their experimental discuss benefits drawbacks each variant through real-world application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-negative Matrix Factorization via Archetypal Analysis

Given a collection of data points, non-negative matrix factorization (NMF) suggests to express them as convex combinations of a small set of ‘archetypes’ with non-negative entries. This decomposition is unique only if the true archetypes are non-negative and sufficiently sparse (or the weights are sufficiently sparse), a regime that is captured by the separability condition and its generalizati...

متن کامل

Ensemble Non-negative Matrix Factorization for Clustering Biomedical Documents

Searching and mining biomedical literature database, such as MEDLINE, is the main source of generating scientific hypothesis for biomedical researchers. Through grouping similar documents together, clustering techniques can facilitate user’s need of effectively finding interested documents. Since non-negative matrix factorization (NMF) can effectively capture the latent semantic space with non-...

متن کامل

Recovery Guarantee of Non-negative Matrix Factorization via Alternating Updates

Non-negative matrix factorization is a popular tool for decomposing data into feature and weight matrices under non-negativity constraints. It enjoys practical success but is poorly understood theoretically. This paper proposes an algorithm that alternates between decoding the weights and updating the features, and shows that assuming a generative model of the data, it provably recovers the gro...

متن کامل

Bayesian Non-negative Matrix Factorization

We present a Bayesian treatment of non-negative matrix factorization (NMF), based on a normal likelihood and exponential priors, and derive an efficient Gibbs sampler to approximate the posterior density of the NMF factors. On a chemical brain imaging data set, we show that this improves interpretability by providing uncertainty estimates. We discuss how the Gibbs sampler can be used for model ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: SIAM undergraduate research online

سال: 2022

ISSN: ['2327-7807']

DOI: https://doi.org/10.1137/21s1414486